Are open set classification methods effective on large-scale datasets?
نویسندگان
چکیده
منابع مشابه
FilterBoost: Regression and Classification on Large Datasets
We study boosting in the filtering setting, where the booster draws examples from an oracle instead of using a fixed training set and so may train efficiently on very large datasets. Our algorithm FilterBoost, which is based on a logistic regression technique proposed by Collins et al. (2002), requires fewer assumptions to achieve bounds equivalent to or better than previous work. Our proofs de...
متن کاملEffective Distribution of Large Scale Datasets Clustering Based on Map Reduce
Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenge sinclude analysis, capture, data curation,search, sharing, storage, transfer, visualization, querying andinformation privacy. The term often refers simply to the use of predictive analytics or certain other advanced methods to extract value from data, and seldom to ...
متن کاملSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
With the growing amount of published research, automatic evaluation of scholarly publications is becoming an important task. In this paper we address this problem and present a simple and transparent approach for evaluating the importance of scholarly publications. Our method has been ranked among the top performers in the WSDM Cup 2016 Challenge. The first part of this paper describes our meth...
متن کاملLarge scale statistical analysis of GEO datasets
The problem addressed here is that of simultaneous treatment of several gene expression datasets, possibly collected under different experimental conditions and/or platforms. Using robust statistics, a large scale statistical analysis has been conducted over 20 datasets downloaded from the Gene Expression Omnibus repository. The differences between datasets are compared to the variability insid...
متن کاملCross-Validation Optimization for Large Scale Structured Classification Kernel Methods
We propose a highly efficient framework for penalized likelihood kernel methods applied to multiclass models with a large, structured set of classes. As opposed to many previous approaches which try to decompose the fitting problem into many smaller ones, we focus on a Newton optimization of the complete model, making use of model structure and linear conjugate gradients in order to approximate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS ONE
سال: 2020
ISSN: 1932-6203
DOI: 10.1371/journal.pone.0238302